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Abstract 

This paper presents no new results; its goals are purely pedagogical. A special case of 
the Cartan Decomposition has found much utility in the field of quantum computing, 
especially in its sub-field of quantum compiling. This special case allows one to factor 
a general 2-qubit operation (i.e., an element of U(4)) into local operations applied 
before and after a three parameter, non-local operation. In this paper, we give a 
complete and rigorous proof of this special case of Cartan's Decomposition. From 
the point of view of QC programmers who might not be familiar with the subtleties 
of Lie Group Theory, the proof given here has the virtues, that it is constructive in 
nature, and that it uses only Linear Algebra. The constructive proof presented in 
this paper is implemented in some Octave/Matlab m- files that are included with the 
paper. Thus, this paper serves as documentation for the attached m-files. 



1 Introduction and Motivation 

Cartan's KAK Decomposition was discovered by the awesome mathematical genius, 
Elie Cartan (1869-1951). Henceforth, for succinctness, we will refer to his decomposi- 
tion merely as KAK. The letters KAK come from the fact that in stating and proving 
KAK, one considers a group G = exp(g) with a subgroup K_ = exp(k) and a Cartan 
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subalgebra a, where g = k © k and a C k . Then one shows that any GeG can 
be expressed as G = KiAK 2 , where Ki,K 2 G K_ and A G exp(a). An authoritative 
discussion of KAK can be found in the book by HelgasonpQ. 

KAK was first applied to quantum computing (QC) by Khaneja and Glaser 
in Refs. Since we are using "KAK" to refer to the general theorem, we will use 
"KAK1" to refer to the special case of KAK used by Khaneja and Glaser. Besides 
KAK1, the Cosine-Sine Decomposition (CSD)j3]@] is another decomposition that is 
very useful|5J in QC. After Refs.@ and |1J, QC workers came to the realization |J] that 
CSD also follows from KAK , even though CSD was discovered jl] quite independently 
from KAK. 

This paper will only discuss KAK1. KAK1 is the assertion that: Given any 
U G SU(4:), one can find Ai, A Q , Bx, B G SU(2) and k G M 3 so that 

U=(A 1 ®A )e itS (B 1 ®B ) , (1) 

where £ is an operator that is independent of U and will be defined later. Thus KAK1 
parameterizes SU(4), a 15-parameter Lie Group, so that 12 parameters characterize 
local operations, and only 3 parameters (the 3 components of k) characterize non-local 
ones. 

Ever since Refs.|2j appeared, many workers other than Khaneja and Glaser 
have used KAK1 in QC to great advantage (see, for example, Refs. j7], [H], 0). Mainly, 
they have used KAK1 to compile 2-qubit operations. For instance, Vidal and Dawson 
used KAK1 to prove that any 2-qubit operation can be expressed with 3 or fewer 
CNOTs and some 1-qubit rotations. 

This paper includes a complete, rigorous proof of KAK1 and related theo- 
rems. The proof of KAK1 presented here is based on the well known isomorphism 
5*0(4) = nf|^r^^| an d on a theorem by Eckart and Young (EY)[TU]. The EY the- 
orem gives necessary and sufficient conditions for simultaneous SVD (singular value 
decomposition) of two matrices. The relevance of the EY theorem to KAK1 was 
pointed out in Ref.|llj. The proof of KAK1 given here is a constructive proof, and it 
uses only Linear Algebra. Contrast this to the proof of KAK given in Ref. pQ, which, 
although much more general, is a non-constructive ("existence") proof, and it uses 
advanced concepts in Lie Group Theory. 

Octave is a programming environment and language that is gratis and open 
software. It copies most of Matlab's function names and capabilities in Linear Al- 
gebra. A collection of Octave/ Mat lab m-files that implement the algorithms in this 
paper, can be found at ArXiv (under the "source" for this paper), and at my website 
(www.ar-tiste.com). 
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2 Notation and Other Preliminaries 



In this section, we will define some notation that is used throughout this paper. For 
additional information about our notation, see Ref . j!2j . 

We will use the word "ditto" to mean likewise and respectively. For example, 
u x (ditto, y) is in A (ditto, £?)", means x is in A and y is in B. 

As usual, K, C will stand for the real and complex numbers. For any complex 
matrix A, the symbols A*,A T ,A^ will stand for the complex conjugate, transpose, 
and Hermitian conjugate, respectively, of A. (Hermitian conjugate a.k.a. conjugate 
transpose and adjoint) 

The Pauli matrices are defined by: 
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They satisfy 



a Y 



a x cr Y 







-OyOx = U?Z 



(2) 



(3) 



and the two other equations obtained from this one by permuting the indices (x,y, z) 
cyclically. We will also have occasion to use the operator a, defined by: 

a = (ax,cr Y ,crz) ■ (4) 

Let ax,, for fj, £ {0,1,2,3} be defined by a Xo — &i — h, where I 2 is the 2 
dimensional identity matrix, a Xl = &x, &x 2 — a Y, and ax A = &z- Now define 



for fM,p £ {0, 1, 2, 3}. For example, oxy = <Jx ® <Jy and a\x 



(5) 



l2®&x- The matrices 
ox^x v satisfy 

(JxxOyy = &YY&XX = -o zz , (6) 

and the two other equations obtained from this one by permuting the indices (x, y, z) 
cyclically. We will also have occasion to use the operator S, defined by: 



{&XX, &YY, OZZ) 



Define 



M 



1 
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(8) 



It is easy to check that M. is a unitary matrix. The columns of M. are an orthonormal 
basis, often called the "magic basis" in the quantum computing literature. (That's 
why we have chosen to call this matrix Ai, because of the "m" in magic). 
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In this paper, we often need to find the outcome Ai^XAi (or AiXAi^) of a 
similarity transformation ( equivalent to a change of basis) of a matrix X G C 4x4 
with respect to M.. Since X can always be expressed as a linear combination of 
the ox^Xv-, h is useful to know the outcomes M.\<Jx ll x v )M- (or M(o Xll x v )^) for 
H, v e {0, 1, 2, 3}. One finds the following two tables: 



M\A®B)M 



M(A®B)M^ 
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3 Proof of KAK1 



In this section, we present a proof of KAK1 and related theorems. The proofs are 
constructive in nature and yield the algorithms used in our software for calculating 
KAK1. Thus, even those persons that are not too enamored with mathematical proofs 
may benefit from reading this section. 

Theorem 1 Define a map <f> by 

: SU(2) -> SO(3) , (j)(A) = M\A®A*)M . (11) 

Then (ft is a well defined, onto, 2-1, homomorphism. Well-defined: For all A e SU{2), 
M*(A® A*)M E SO(3). Onto: For all Q e SO(3), there exist A £ SU(2) such that 
Q = JA^(A<g) A*)M.. 2-1: 4> maps exactly two elements (A and —A) into one (<f>(A)). 
Homomorphism: <p preserves group operations. 

Theorem 2 Define a map $ by 

$ : SU{2) x SU(2) -> SO (A) , B) = M\A <g> B*)M . (12) 

Then $ is a well defined, onto, 2-1, homomorphism. Well-defined: For all A,B£ 
SU{2), M\A®B*)M G SO(A). Onto: For allQ e SO(A), there exist A, B e SU{2) 
such that Q = M.\A <g> B*)A4. 2-1: <fi maps exactly two elements ((A, B) and 
(—A,—B)) into one (<f>(A,B)). Homomorphism: $ preserves group operations. 
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Theorems [0 and |2 are proven in most modern treatises on quaternions, albeit 
using a different language, the language of quaternions. See Version 2 or higher of 
Ref . ^2] ) for proofs of Theorems Q and 121 given in the language favored here and 
within the quantum computing community. 



x+x* v _ x-x* 



Lemma 3 Suppose X is a unitary matrix and define Xr = — ^ — , Xi - . 
( X X \ 

Then Q = I ) is an orthogonal matrix. Furthermore, Xr and Xj are real 

\ —Xi X R J 

matrices satisfying XrX r + XjXf = X r Xr + XfXj = 1. Furthermore, XjX R and 
XJXr are both real, symmetric matrices. 

proof: 

1 = XX* = (X R + iXj){X T R - iXj) , (13) 



so 



and 



From 1 = 1*1 we also get 



and 



X R X* + X I Xf = l, (14a) 

XjX T R - X R Xj = . (14b) 
X R \X R + XjX I = l, (15a) 



XjX R - X T R X! = . (15b) 

Note that Eqs.(jHJ) and Eqs. fTHjl are identical except that in Eqs.dHJ), the second ma- 
trix of each product is transposed, whereas in Eqs. (jl5j) . the first is. Q is clearly a real 
matrix, and Eqs.(|14|) imply that its columns are orthonormal. Hence Q is orthogonal. 
Eq. (jl4bj) (ditto, Eq. ()15bj) ) implies that (ditto, XjX R ) is symmetric. 

QED 

The next theorem, due to Eckart and Young, gives necessary and sufficient 
conditions for finding a pair of unitary matrices U, V that simultaneously accomplish 
the SVD (singular value decomposition) of two same-sized but otherwise arbitrary 
matrices A and B. The proof reveals that the problem of finding simultaneous SVD's 
can be reduced to the simpler problem of finding simultaneous diagonalizations of 
two commuting Hermitian matrices. The problem of simultaneously diagonalizing 
two commuting Hermitian operators (a.k.a. observables) is well known to physicists 
from their study of Quantum Mechanics. 

Theorem 4 (Eckart- Young) Suppose A,B are two complex (ditto, real) rectangular 
matrices of the same size. There exist two unitary (ditto, orthogonal) matrices U,V 
such that Di = U^AV and D 2 = U^BV are both real diagonal matrices if and only if 
AB^ and A^B are Hermitian (ditto, real symmetric) matrices. 
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proof: 

(=0 AB* = UD x D 2 lfl and A*B = VD 1 D 2 V^ so they are Hermitian. 
(«=)Let 



be a SVD of A. Thus, Ua,Va are unitary matrices, 02,0 3 ,0 4 are zero matrices, and 
D is a square diagonal matrix whose diagonal elements are strictly positive. Let 

B> = U\BV A =( G T K u ) , (17) 



where D and G are square matrices of the same dimension, rank(A). Note that 

A'B'> = B'A't ^ | (1! 

and 



) \ LD 



Therefore, 



and 



L = K = 0, (20) 



DG t = , L>G = G^D . (21) 
When written in index notation, Eqs. (j21j) become 

d i9ji = dijdj , digij = g^d, , (22) 
where the indices range over {1,2,..., rank(A)}. Eqs. (j22|) imply 

{d i + d j ){g* i -g ij ) = Q. (23) 

Since di > 0, we conclude that G is a Hermitian matrix. D is Hermitian too, and, by 
virtue of Eq. (j21j) . D and G commute. Thus, these two commuting observables can 
be diagonalized simultaneously. Let P be a unitary matrix that accomplishes this 
diagonalization: 

D = P ] DP, D G = P ] GP. (24) 

Let 

D H = U ] H HV H (25) 
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be a SVD of H. Dh is a diagonal matrix with non-negative diagonal entries and 
Uh , Vh are unitary matrices. Now let 



pt 

o f/i 



P 
Vff 



(26) 



The matrices U and y defined by Eq.(|26|) can be taken to be the matrices U and V 

defined in the statement of the theorem. 

QED 

Corollary 5 If X is a unitary matrix, then there exist orthogonal matrices Ql and 
Qr and a diagonal unitary matrix e l@ such that X = Ql^Q T r- 

proof: Let Xr and Xj be defined as in Lemma EJ According to Lemma 01 XjX^ 
and XJXr are real symmetric matrices, so we can apply Theorem 0] with A = Xr 
and B = Xj. Thus, there exist orthogonal matrices Qr and Ql such that 

Dr = Q t l X r Qr, Dj = Q t l XjQr, (27) 

where Dr, Dj are real diagonal matrices. Since X is unitary, Dr + iDj is too. Thus, 
we can define a diagonal unitary matrix e l& by 



AO 



Dr + iDj . 



Combining Eqs.(|2T)l and 



finally yields 

e ie = Q t l XQr 



(2f 



(29) 



QED 



Let t 



&2,03, 8i) e M 4 and 9 = diag(t) so that 



e l@ = diag{e id \e ld \e ld \e idi ) . 
Let (k , k) e R 4 . According to Eq.©, 



If we set 



(30) 



(31) 



e ie = M ] e i{ko+tt) M , (32) 
then each point (#i, #2, #3, #4) £ is mapped in a 1-1 onto fashion into each point 
(ho, k) G M 4 . Using the explicit forms of o~zi, (?iz, o~zz, one finds that 



( k \ 

h 

\k 3 J 



where T 



( +1 
+1 
+1 

V+ 1 



■1 +1\ 

-1 -1 

■1 -1 

-1 +1 / 



(33) 
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It is easy to check that 



r- 1 = T • (34) 

Corollary 6 (KAK1) If X G U(A), then X = (Ai ® A )e i ( fco+ ^^)(^i ® #o), ^ere 
A x , A , B u B G SU(2) and (k , k) G M 4 . 

proof: Let 

X' = M ] XM . (35) 

X' is a unitary matrix, so, according to ColloraryEJ we can find orthogonal matrices 
Ql, Qr and a diagonal unitary matrix e* e such that 

X' = Q L e ie Q T R . (36) 
According to Theorem we can find Ai, Aq, Bi, Bq G SU(2) such that 

MQ L M^ = A 1 ®A , (37) 

and 

MQb,M ] = B x ® 5 . (38) 

As in Eq.(JHD, set 



Ate^A** = e l(ko+k ^ . (39) 

It follows that 

X = {A 1 ® A )e l(ko+ ^ (B 1 <g> So) . (40) 



QED 



4 Canonical Class Vector 

In this section we discuss how KAK1 partitions 57/(4) into disjoint classes character- 
ized by a 3d real vector k. 

We will say that U, V G /Si/ (4) are equivalent up to local operations and 
write U ~ V if t/ = {R x ® i2o)V(Si <8> 5 ) where R u Rq,Si,Sq G i/(2). It is easy 
to prove that ~ is an equivalence relation. Hence, it partitions SU(A) into disjoint 
subsets (i.e., equivalence classes). If X G Si/ (4) and A; G M 3 are related as in Collorary 
El then X ~ e lfc S . Henceforth, we will call this k a class vector of X. We will say 
that k! and k are equivalent class vectors and write k' ~ k if e lfc ' E ~ e lfc E . 

Note that the following 3 operations map a class vector into another class 
vector of the same class; i.e., the operations are class-preserving. 
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1 . (Shift) Suppose we shift k by plus or minus f along any one of its 3 components. 
For example, a positive, f , X-shift would map 

7T 

{k x , ky, kz) i > (k x + — , ky, kz) ■ (41) 
This operation preserves Fs class because 

e i[(k x + §)(T XX +kyaYY+kzCTzz} _ ^^xx^k-t _ i(X xx ^'^ . (42) 

2. (Reverse) Suppose we reverse the sign of any two components of k. For example, 
an XY-reversal would map 

[k x , ky, k z ) i ► ( k x , ky, k z ) . (43) 
This operation preserves &'s class because 

a zl e ik '^a zl = e i(.-k*,-kv,k,yn _ ^ 

— * 

3. (Swap) Suppose we swap any two components of k. For example, an XY-swap 
would map 

{k x , k y , k z ) i— > (ky, k x , kz) ■ (45) 
This operation preserves fc's class because 

g-»f (o-zi+CTiz) e ifc-S e iJ (cr zl +cr iz ) _ e i(k y ,k x ,k z )-Y, ^ /^g\ 

Define /C as the set of points fcel 3 such that 

1. ^> k x >k y >k z >0 

2. fcjj + fo^ < f 

3. If fc, = 0, then ^ < f . 

/C is contained within the tetrahedral region OA1A2A3 of FigQ 

The 3 class-preserving operations given above generate a group W_. Given 
any class vector k G M 3 , it is always possible to find an operation G G W_ such that 
G(k) G /C. Indeed, here is an algorithm, (implemented in the accompanying Octave 
software) that finds G(k) G K, for any k G M 3 : 

1. Make k x G [Of) by shifting k x repeatedly by f . In the same way, shift k y and 
k z into [Of). 
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71/2 



Figure 1: The set /C of canonical class vectors equals the set of points in the tetra- 
hedral region OAiA 2 A 3 (including all the interior and the surface points except that 
only the half OBA 2 of the base OA 1 A 2 is included). 



2. Make k x > k y > k z by swapping the components of k. 

3. Perform this step iff at this point k x +k y > |. Transform k into {^—k y ,j—k x ,k z ) 
(This can be achieved by applying an XY-swap, XY-reverse, X-shift and Y-shift, 
in that order). At this point, k x > k y , but k z may be larger than k y or k x , so 
finish this step by swapping coordinates until k x > k y > k z again. 

4. Perform this step iff at this point k z = and k x > |. Transform k = (k x , k y , 0) 
into (| — k x ,k y ,0) (This can be achieved by applying an XZ-reverse and an 
X-shift, in that order). 

We can find a subset S of R 3 such that every equivalence class of SU (4) is 
represented by one and only one point k of S. In fact, K defined above is one such 
S. We will refer to the elements of /C as canonical class vectors. 

We end this section by finding the canonical class vectors of some simple 2- 
qubit operations. 



1. (CNOT): CNOT{\ -> 0) is defined by 



CXOTi ] - 0) fT V (0) n(1) = ( J ff ° ) • (17) 
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Since n = \{1 - a z ), n x = |(1 - a x ), and a x = (-l) n * = e i7Tnx , 



a x (0) n(1) = (_l)"*(o)n(i) ( 48a ) 

= e *f (i-cix)(i-o- Z i) (48b) 

= gi|(l-<rxi-<Tix)gi|^x ( 48c ) 

— e *f (l-o"Zi-CTix) e «f o-yigif o-xjfg-if o-yi (48d) 

~ e^ axx . (48e) 

Therefore, the canonical class vector of CNOT is (j, 0, 0), which corresponds to 
the point B in FigQ 



2. (V CNOT) From the math just performed for CNOT, it is clear that 

y / a x (0) n ( 1 ) = e *f(i-*i*)(i-<Tzi) ^ e *W* _ (49) 

Therefore, the canonical class vector of v CNOT is (|, 0, 0), which corresponds 
to the midpoint of the segment OS in Fig^ 

3. (Exchanger, a.k.a. Swapper) As usual, the Exchanger is defined by 



E 



/ 1 \ 
10 
10 
\ 1 / 

(Note that det(E) = — 1). Using Eqs.©, it is easy to show that 



(50) 



= g-*f g*f (VXX+VYY+VZZ) _ (52) 

Therefore, the canonical class vector of e l ^E is (|, |, |), which corresponds to 
the apex v4 3 of the tetrahedron in Fig^ 



5 Software 

A collection of Octave/Matlab m-files that implement the algorithms in this pa- 
per, can be found at ArXiv (under the "source" for this paper), and at my website 
(www.ar-tiste.com). These m-files have only been tested on Octave, but they should 
run on Matlab with few or no modifications. A file called "m-fun-index.html" that 
accompanies the m-files lists each function and its purpose. 
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